The Architecture of Idiotypic Networks: Percolation and Scaling 

Behaviour 



Markus Brede and Ulrich Behn 

Institut fur Theoretische Physik, Universitat Leipzig, Augustusplatz 10, D- 04109 Leipzig, 

Germany 



Abstract 

We investigate a model where idiotypes (characterizing B-lymphocytes and 
antibodies of an immune system) and anti-idiotypes are represented by com- 
plementary bitstrings of a given length d allowing for a number of mismatches 
(matching rules). In this model, the vertices of the hypercube in dimension d 
represent the potential repertoire of idiotypes. A random set of (with proba- 
bility p) occupied vertices corresponds to the expressed repertoire of idiotypes 
at a given moment. Vertices of this set linked by the above matching rules 
build random clusters. We give a structural and statistical characterization 
of these clusters - or in other words - of the architecture of the idiotypic 
network. Increasing the probability p one finds at a critical p a percolation 
transition where for the first time a large connected graph occures with prob- 
ability one. Increasing p further, there is a second transition above which the 
repertoire is complete in the sense that any newly introduced idiotype finds 
a complementary anti-idiotype. We introduce structural characteristics such 
as the mass distributions and the fragmentation rate for random clusters, 
and determine the scaling behaviour of the cluster size distribution near the 
percolation transition, including finite size corrections. We find that slightly 
above the percolation transition the large connected cluster (the central part 
of the idiotypic network) consists typically of one highly connected part and a 
number of weakly connected constituents and coexists with a number of small, 
isolated clusters. This is in accordance with the picture of a central and a 
peripheral part of the idiotypic network and gives some support to idealized 
architectures of the central part used in recent dynamical mean field models. 

PACS numbers: 64.60.Ak, 05.10.Ln, 02.70.Lq, 87.18.-h 
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I. INTRODUCTION 



B-lymphocytes carry on their surface highly specific receptors, so-called antibodies. If 
these receptors detect complementary structures, the lymphocyte is stimulated to proliferate 
and after several generations and differentation into plasma cells secretes antibodies of the 
same specificity. B-lymphocytes and antibodies of a given specificity are said to have a cer- 
tain idiotype. Complementary structures to an idiotype are antigen or other, anti-idiotypic 
antibodies. Between B-lymphocytes of different idiotype thus emerges a functional network 
of mutual stimulation and inhibition, the idiotypic network JT|. The idiotypic network is 
supposed to at least partially contribute to the functionality of the immune system as, e.g., 
immunological memory or suppression of autoreactive clones. Though quantitative data are 
very hard to access by experiment there are some recent observations which underline the 
importance of idiotypic interactions [Q.Q . 

New idiotypes are produced in the bone marrow or due to hypermutation during the 
proliferation of stimulated lymphocytes which introduces a random metadynamics of the 
repertoire. At a given moment the random network has a certain architecture. The aim of 
this paper is to give a statistical description of this architecture. Knowledge of the typical 
architecture of the idiotypic network is crucial for describing the population dynamics of the 
interacting B-lymphocytes and antibodies, cf. @||, which is however not the subject of this 
paper. 

Derived from hypotheses of theoretical immunology (cf. jjjjj), for a recent review see [0) 
we present a statistical analysis of bit-string based networks, which shows, that this approach 
is well suited to reproduce reasonable network structures. Especially, our considerations 
show, that realistic network topologies can be conceived as an extension of first approaches 
to that problem which assumed a Bethe-lattice structure. Suggestions of how loops should 
be added to such structures have been made previously [EJ. For this, our work provides a 
very natural access. 

Generally, idiotypic networks are supposed to realize a tradeoff between two basic re- 
quirements: they should contain a great number of small isolated components, but on the 
other hand still be able to respond to arbitrary antigen, that means being complete. Small 
components are thought necessary to store information about previously encountered anti- 
gen |7|,|9|JT0H . The existence of such components does obviously demand a low connectivity 
of the network. With completeness on the other hand it is assumed that of a great number 
of antibodies each is able to detect many different types of antigen. Hence the network 
connectivity should not be too low. 

With regard to the underlying biological problem the one great-many small clusters 
situation is thus worthwhile to earn special attention. Theoretical immunologists suppose 
that the idiotypic network should consist of a large number of small clusters. On the other 
hand, as a consequence of relatively high connectivities, also a large component should be 
contained within the idiotypic network, which is denoted as its central part. This great 
cluster could play an important role in the control of autoreactive clones T3|. 



In the following the bit-string model of []nj will be explained briefly. Basically, antibodies 
are identified with bit chains of a given length d. Thus, there are 2 d different antibody types. 
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The set of all conceivable antibodies, i.e. the potential repertoire, is then represented by 
{i = (ii, ...,i d ),ij G {0, 1}}. Estimating the probable size of networks that can be complete 
in the above sense shows that d ~ 32... 36 should be a good value for realistic models ]15| . 



Antibodies recognize each other if they have complementary structures, i.e. if they are 
represented by perfectly complementary bit strings. If there are small deviations from the 
exact structural complementarity a matching is still possible, though with lower affinity 
This is described by so-called matching rules. 

For example we imagine that antibodies react if the respective structures are complemen- 
tary except one small region. Using the language of our model the corresponding bit-strings 
should be inverse except one bit that belongs to the non-complementary region. We call this 
kind of rule a one-mismatch rule. Two-mismatch rules, which allow reactions between anti- 
bodies that are complementary up to two non-matching regions, are accordingly defined by 
connecting bit-strings that are complementary with two exceptions allowed. Naturally, the 
multitude of possible rules is not exhausted by mismatch rules. Rules that express matching 
of mutually shifted antibody parts could be conceived as well. For the sake of simplicity 
this work is confined to those matching rules that are associated with reactions of highest 
affinity, i.e. inversion, one- and two-mismatch rules. 

Mathematically, the set of all antibody types (represented by bit-strings or vertices of 
a hypercube) together with all possible reactions between them defines a graph. Thus, we 
want to call 

G d —({v = (v i, Vd), Vj G {0, 1}}, {v connected with w 
if Vi = Hi] for all i,j = 1, d except one position 
at the most}) (1) 

the one-mismatch base graph and 

G 2 d —({v = {y i, Vd), Vj G {0, 1}}, {v connected with w 
if Vi = Wj for all i,j = 1, d except two positions 
at the most}) (2) 

the two-mismatch base graph, respectively. To obtain a better impression of one-mismatch 
graphs they can be compared to hypercubes of the same dimension. Looking, e.g., on 
hypercubes of dimension d = 3 edges of one-mismatch graphs are represented by all space 
and side diagonals. This, together with a picture of an one-mismatch graph in dimension 4 
is shown in Figs. [TJ and |2|, respectively. 

The actual repertoire, i.e. the set of all types of idiotypes that are really existing within 
the body at a given time, is a subset of the potential repertoire. There are profound reasons 
to believe that the actual repertoire should be distributed randomly within the potential 
repertoire f7j. A way of realizing this is to choose antibodies with a certain probability 
p. Then the idiotypic network is represented by the random graph that is composed of all 
occupied vertices (antibodies) and their connections (possible reactions). Clearly, this is a 
site percolation problem JIE| . 

Percolation problems on standard lattices are far away from beeing a new field of research. 
Percolation on one-mismatch graphs, however, bears some major difference to ordinary 
percolation. 
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Percolation is always connected to an abrupt change of some system property (the stan- 
dard property is the existence of a connecting path from the upper to the lower boundary on 
2-dimensional lattices) if the occupation probability passes a certain value, the percolation 
threshold. This transition becomes sharp in the limit of infinite systems. 

On lattices there is no question how to increase the system size to infinity. However, 
sequences of one-mismatch graphs exhibit relations between small and large systems that 
are essentially different from similar characteristics of lattices which are created by the 
multiplication of a unit cell. Understanding of these distinctions comes from studying the 
structure of fully occupied one-mismatch graphs (base graphs) which will be performed in 
the ensuing Section. 

In this paper we investigate the architecture of functional networks built by constituents 
of randomly generated characteristics interacting with complementary constituents. Our 
motivation, the formulation of the problem and the interpretation of the results are all 
in the context of modeling the immune system using language and methods of statistical 
physics. However, our results are essentially independent of the specific immunological inter- 
pretation and could be of broader interest. The situation of interacting randomly generated 
complementary constituents is quite general. For example think of chemical reactions (origin 
of life), ecosystems, social networks. 

Generally, the conception of the paper is as follows: in Sec. II topological properties 
of base-graphs will be studied. Section III gives an analysis of the underlying percolation 
problem, which then leads, together with applications of random graph theory from Sec. IV 
and a study of intrinsic structures of great clusters in Sec. V, to conclusions about parameter 
regimes, in which bit-string model induced random graphs resemble idiotypic networks. In 
the appendix another method to calculate thresholds via the renormalization of small cells 
will be discussed. 



II. STRUCTURE OF THE BASE GRAPH FOR THE ONE-MISMATCH RULE 

The above definition of matching rules allows an easy calculation of distances between 
vertices of G\. Let i,j G G\ be vertices, du their Hamming distance (i.e. the number of 
different bits between them) and 

d G (i,j) = inf l(w) , (3) 

pathes w in 
connecting i and j 



where l(w) denotes the length of a path w, a metrics. Then it holds that if 
d H < d/2 d G -- 



du for du even 

d H + 1 for d H odd 
d H — d/2 d G — d/2 for d even (4) 

d — du + 1 for d — du even 
d — d H for d — du odd , 



d H > d/2 d 



G 



which shows that the maximum distance D m£LX (d) between vertices is roughly the half of 
that on ordinary hypercubes. This is essentially due to the inversion rule. Furthermore, it 
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should be noticed that D max (d) does not change when increasing the dimension by one from 
odd (d = 2n + 1) to even (d = 2n + 2) values. 

To obtain additional information about the toplogy we distinguish vertices corresponding 
to their distance s to an arbitrarily chosen origin, e.g. v = = (0, ...,0), whose choice is 
arbitrary due to the specific construction-rule of base graphs. We denote the set of vertices 
with distance dc(v,i) from v by 

E 8 = {ieG 1 d \d G (v,i) = s}. (5) 

Applying (|j) it is possible to compute the number of vertices \E S \ belonging to E s (mass 
distribution) by simple combinatorics to 

if d G = 

E, { . (6) 

if d G < I 

A visualization of Eq. (^j) in Fig. [3] shows a major difference between odd and even dimen- 
sions d. Generally, \E S \ grows monotonically with increasing distance. For odd dimensions, 
however, the number of vertices with maximal distance D max from v is substantially smaller 
than that with distance D m&x — 1 

Differences in the overall structure of graphs in odd and even dimensions d become even 
more obvious if the number of links connecting vertices of the same set E s is considered. 
Using (f|) , a somewhat lengthy but straightforward calculation shows that connected vertices 
in the same distance s from the origin v occur in 

d even only for s = D max — d/2 

d odd only for s = D max = (d+ l)/2 or (7) 

Analogously, the number of links that connect vertices of E s with vertices belonging to E s+ i 
may be computed to d + 1 — s for s < D max — 1 . 

As a consequence there must exist loops of even as well as of odd length. Moreover, all 
short loops with length less than d must be even (since only loops comprising links between 
vertices in the same distance from any loop element can be of an odd length). 

An important property of standard systems in percolation theory is that the smaller 
system is always contained within the larger one. For one-mismatch graphs, however, this is 
not possible. Let d\ < d 2 be dimensions of G\ and G\ 2 , respectively. If G\ were contained 
in G^ 2 it would follow directly that G\ 2 should have uneven loops with length less then d\. 
This gives a contradiction since G\ 2 contains no odd loops smaller than d 2 , whereas d\ < d 2 
by assumption. 

Resuming these results we state that percolation on one mismatch graphs differs from 
standard percolation. A crucial point for this distinction is the way how global system prop- 
erties change if the system size approaches infinity. Nevertheless, methods of percolation 
theory can be applied to this kind of problem. Even more, choosing an appropriate ma- 
jority rule renormalization group procedures can be extended to one-mismatch-graphs (see 
appendix A). 
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To the best of our present knowledge percolation problems on graphs with similar prop- 
erties have not been dealt with. Even percolation on regular hypercubes has found relatively 
little attention in the physical literature WJ§ . 



III. CLUSTER SIZE DISTRIBUTION 

Having collected some simple properties of the underlying base graphs we consider now 
vertices of that graphs occupied with a given probability p. The set of all occupied vertices 
T together with all bonds, which connect two vertices belonging to T forms a random graph. 
In terms of percolation theory a maximal set of connected vertices is called cluster. 

What is the probability that an arbitrarily chosen vertex belongs to a cluster of size |C|? 
This question has already been addressed in |],[nj where three major regimes of the system 
have been identified (see also the numerical results in Fig. |j): 

(i) The first typical situation arises for small values of the occupation probability p. 
Then only small clusters (whose individual size doesn't make up a finite fraction of the 
whole system) are expected to appear. 

(ii) Increasing p and approaching infinite dimensions, a sharp transition (in the sequel 
denoted as percolation transition) to an one great cluster-many small clusters regime was 
found. At the percolation transition some characteristics obey scaling laws. These and an 
approximation to compute percolation thresholds will be dealt with in Sec. IV. 

The one great-many small clusters situation is worthwhile to earn special attention (cf. 
Sec. I). The great cluster could play an important role in the control of autoreactive clones. 



To fulfill that purpose it is believed to have a certain internal structure (cf. [|T^,|T^,0, for 
experimental data cf. jn|), which will be discussed in more detail in Sec. VI. Thus, as stated 
already in 0,|TUJ the simple one-parametric bit chain model exhibits an interesting similarity 
to the idiotypic network. 

(iii) Finally, if the occupation probability is further increased, a state where random 
graphs consist of one connected component only will be reached. Relying on some general 
results of the theory of random graphs it can be shown that this indeed marks a second (in 
the limit of infinite systems) sharp transition which we call denseness transition. In Sec. V 
these considerations are resumed and discussed with reference to the completeness of the 
immune system. 

Exactly the same two thresholds have been considered earlier for a different class of 
random graphs PO,PT| and more recently in the context of networks of RNA secondary 



structures lf22LE3 



IV. SCALING LAWS AND BETHE APPROXIMATION 

A common method to analyze percolation problems is the introduction of perimeter 
polynomials D\c\. Using the standard notation we define 

p(\C\)=P lcl (l-p) lSiM =P lCl D\c\(l ~P), (8) 

where p(|C|) denotes the probability that a vertex on the base graph belongs to a cluster of 
size \C\. Sf rce (C) means the free surface, i.e. the set of all unoccupied vertices of the base 
graph that are adjacent to vertices of C. 
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Generally, there are few problems which allow the explicit calculation of D\c\ for arbitrary 
cluster size \C\. Notwithstanding, it is always possible to compute D\c\ for small values of 
\C\. This provides a basis for the application of series expansion techniques. We investigate 
the structure of small clusters up to size \C\ = 5 on one- mismatch-graphs. As a result for 
large base graphs (d > 5) we find the following perimeter polynomials for one-mismatch- 
graphs 

DM = q d+1 , (9) 
D 2 (q) = (d+l)q 2d , (10) 

D 3 (q) = ^(d+l)dq sd -\ (11) 
D,{q) = q 4d - 4 (2(d - l)d(d + 1) + ^±12 + 

+^(d-l)d(d+l)) , (12) 

q 

D 5 (q) = ^q 5d - 6 (d - l)d(d +l)(d-2 + -(d-l) + 
l q 

7f + ^ [d - 2)) ' (13) 

where q = 1 — p. 

In the following we show that in the limit of high dimensions random graphs on one- 
mismatch-graphs for small values of p are very similar to random graphs on Bethe-lattices 
with the same coordination number. Arguments of this kind have already been applied to 



percolation on hypercubic lattices |25| making use of the fact that loops in random graphs 
on sparsely occupied high dimensional lattices are infrequent. Indeed, the first terms of a 
high dimension expansion on a hypercubic lattice of dimension d are given by the exact 
values of the percolation threshold for the Bethe-lattice with coordination number d. 

A closer look at the perimeter polynomials (|9-13) supports the hypothesis that the sit- 



uation is quite similar for one-mismatch-graphs. For example, the contribution from loops 
to p(A) = p 4 _D 4 (g) is given by p loo P(4) = p A q 4d ~ A d(d + l)/2. Obviously, the relative weight 
of this term vanishes in the high dimension limit. Additional support comes from the ob- 
servation that the one- mismatch base-graph G\ contains hypercubes up to dimension d—1. 
To elucidate this property we consider the following example in d — 4. We construct a 
hypercube of dimension 3 applying the one-mismatch rule allowing for a mismatch in one of 
the, e.g., last three bits. Starting for instance at the origin (0, 0, 0, 0) this yields (1, 1, 1, 0), 
(1, 0, 1, 1), 1, 1, 0, 1). Iteration gives the new vertices (0, 1, 0, 1), (0, 0, 1, 1), (0, 1, 1, 0), and 
finally (1, 0, 0, 0). These 8 vertices are connected by the matching rules like a 3-dimensional 



hypercube. Then follow the arguments of f24],|2"5| 



Consequently the percolation threshold for one-mismatch-graphs can be approximated 

to 

pW = 1/d, (14) 

whereas for two-mismatch graphs holds 

pf) = l/(d + d(d+l)/2). (15) 
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Corrections to ( |i4|) and (|15D are of order 0(d~ 2 ) and 0(d~ 3 ), respectively. From consider- 
ations of Sec. I we find that the number of links connecting vertices in E s with vertices in 
E s+ i decreases with growing distance s. Thus, we conclude that the corrections to p^ must 
be positive. 

Typically for percolation problems, certain system characteristics obey scaling laws at 
the percolation threshold. This kind of laws reflects the statistical self similarity of clusters 
on different length scales in that parameter regime. Fig. |^ displays simulation results for 
the cluster size distributions obtained for one-mismatch-graphs of dimensions 10 and 14 . 
Both data sets can be well described by the finite size scaling ansatz 

P (\c\) = \crF(\c\/\cn, (i6) 

where the function F(x) is nearly a constant for i C 1 and will be more rapidly declining 
than any power law for arguments x ^> 1. Equation (11(3) should apply for clusters which 



are neither too small nor too large. Due to the limited size of the investigated systems, 
very large clusters behave in a different way than clusters of 'standard' size. The transition 
between the laws applying to these separate cases is marked by \C\* , which in turn depends 
on the extent of the system. 

Analogous properties can be observed for base graphs defined by other matching rules. 
Figure || illustrates scaling behaviour for the case of graphs diced on two-mismatch-base 
graphs for values of p = pf^ cf. ([15|). As a natural consequence of adding supplementary 
edges distances between vertices will become smaller in comparison to one-mismatch-graphs. 
This gives an explanation for the fact, that typical sizes \C\* are substantially smaller than 
those examined on one-mismatch-graphs with the same choice of d. 

Furthermore, it can be seen in Fig. ^ that the scaling law (ff6|) even well applies to very 
small clusters. It is useful to define a cluster size dependent exponent t\ C \ by 

i n p(\c\+i) 

_ P (\c\) n7 , 

|C| 

Evaluating the perimeter polynomials (P[)-([T3[), values of t\c\ for small |C| can be derived. 
Thus we obtain a sequence {t\c\} which should approach the true value of r for large \C\. 

As a matter of fact ( |16D changes to p(|C|) ~ \C\~ T in the limit of infinite systems. 
Performing the limits d — > oo in Eqs. (|9|)-([T3|) and (|17|), we observe that r,q obeys 

^ - HT^y - ici + 1 , (is) 



which by comparison of fllTD with (18) and solving the recursion relation implies the law 



-\C\\ C \\C\-2 

P(\ C \) = -^ ] ( 19 ) 

for the cluster size distribution. Equation ( ji~8f) leads to r = 3/2 for large clusters. The thus 
computed value r = 3/2 is in accord with the exact result on the Bethe-lattice. 

Since we investigate a model for a biological system our main interest is devoted to 
large, yet finite systems. In the sequence {t|c|} our best approximation for the real value of 



S 



t is T4. A comparison between values of r computed by r rs T4 and exponents r obtained 
by evaluating numerical data for small systems suggests that the involved approximation 
becomes rapidly more accurate if the system size is increased. Hence in d = 32 we rely on 
r 4 and find r ps 1.5, which gives a result that also supports the previous assumption. 

Subsequently it was our aim to find a quantity which gives an overall estimation of 
deviations between random graphs on one-mismatch-graphs from those on Bethe-lattices 
of equal coordination number. For this purpose it appears appropriate to investigate the 
results of an edge elimination procedure which computes the number of bonds belonging to 
loops. Similar procedures have very recently |27J been applied to lattices. 

The above aim is achieved by individually removing every edge of every cluster and 
calculating the number of the connected components of the resulting graph. An edge is 
called fractioning if two components are obtained cutting this edge. Clearly, only edges that 
belong to loops are not fractioning. 

Then the ratio of fractioning bonds f{C) to the overall number of bonds b(C) of a 
cluster C, f(C)/b(C), is some indication for the importance of loops within the structure 
of C. Distinguishing clusters according to their size \C\ we computed the mean fractioning 
ratio 

/id =< f(C')/b(C) > [c > H c\ ■ (20) 

Except for very small clusters with trivial structure we expect a finite size scaling law 

fc\ = \C\- X F(\C\/\C*\) (21) 

for the fragmentation rate / at the percolation threshold, the validity of which is illustrated 
in Fig. [7[ 

Similar to r\ C \ a cluster size dependent exponent \\c\ may be defined. The only 4- and 
5-clusters containing non-fragmenting bonds are the 4-loop (no fractioning bond) and the 
4-loop with a tail (1/5 of all bonds fractioning) which give the contributions p 4_lo °P(4) and 
^tailed 4-ioo P ( 5 ) = ^/2 p 5 q 5d - 7 (d - l)d(d + 1) to p(4) and p(5), respectively. Making use of (pD 
and flT3P we then find 

_ p (4)-p 4 - 1 °°P(4) _ 1 

h ~ M ~ i+ 1 - (22) 

P[ - > + 4(d-l)(l+l/(3 9 )) 

and 

p(4) — p tailed 4-loop^ j ^tailed 4-loop ^ ^ j 

h — 77^ 1" 7 77^ = 1 — 



p(4) 5 p(4) 51 + l/g+(c/-2)(l + g + l/(12g 2 )) ' 

(23) 

. Finally, inserting p fa p^ = d~ x + 0(d~ 2 ) a rough estimate for A is A4 = (ln/5 — 
ln/4)/ln(5/4) which gives 

> /a 1 ( 393 , 1 4029120 , 2 ^ /l3 A 

x ' {d) " R574) {^W d + Twr d + 0{d » ) ■ (24) 

For d — 10 we have A ~ 0.11 in very good agreement with the numerically obtained value 
0.12, cf. Fig. 0. For d = 32 we find A ~ 0.03 already very close to A = which holds for 
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Bethe-lattices. Thus the value A > measured on finite dimensional random graphs on one- 
mismatch base-graphs - which is caused by a small number of loops - should quantify the 
deviation from random graphs on Bethe-lattices. It appears that the occupation probability 
at the percolation threshold is still small enough to apply the Bethe approximation. 

Yet, random graphs at p c are not exactly Bethe-like and contain a certain fraction 
of loops. Otherwise it would be impossible to distinguish subclusters according to their 
connectivity within any considerable connected component. This, however, is likely to be 
necessary, to explain the role of the central part of idiotypic networks properly Jl3| . 



Further investigations concerning the structure of great clusters will be made in Sec. V. 



V. DENSITY THRESHOLD AND THE COMPLETENESS OF THE IDIOTYPIC 

NETWORK 

Hitherto, our main interest was devoted to the question at which occupation probability 
a great cluster starts to exist. If p, however, is further increased it is also imaginable that a 
situation occurs, where the whole system consists of one great cluster only. This is referred 
to as the question about the connectivity property of random graphs. 

In the theory of random graphs some general results that address related problems have 
been derived for so called sequences of configuration spaces p^j23| and earlier for a different 



general class of random graphs in p0|,2l|. It can be shown that the sequence of one-mismatch- 



base graphs {G^} fulfills all requirements of these configurations spaces [ft^l , due to its rather 
technical nature the proof will be omitted here. 

General results of can be applied here showing that for infinite systems there 

exists a threshold 

p conn = l- lim IGlr 1 ^ (25) 

d^oo 

(7d being the coordination number) with the property that almost all random graphs are 
connected for p > p conn while the set of all connected random graphs has measure zero for 

P *^ Pconn- 

Moreover, for sequences of configuration spaces it holds that both the connectivity and 
the density treshold, i.e. the threshold for the property that there is no non-occupied site 
without occupied neighbours on the base graph, coincide. Using (|2~5"[) we find 



P2L = p£L = 1/2 (26) 

in case of one-mismatch-graphs and 

Pconn = Pdense = (27) 

for two-mismatch graphs. 

Density of random graphs in our model can be directly translated into biological terms. 
Since antibodies and antigens are represented by the same sets of bit chains the property 
that there is no free site without occupied neighbour means that every antigen is sure to 
encounter a complementary antibody. Hence the idiotypic network is able to respond to any 
antigen. This is meant by the completeness axiom M\ for the immune system. 
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Nevertheless it is somewhat difficult to reconcile the demands for denseness of random 
graphs and occurance of small clusters at the same time. However, it seems unlikely that 
completeness should be understood in this strict way. Rather it appears to be a better 
solution to consider the fact, that evolution has most likely driven the idiotypic network 
into such a kind of arrangement, that it is able to respond to variations of really existing 
antigen only. Thus we argue that completeness of the idiotypic network does not exactly 
match density of random graphs, but requires only the probability for the density of the cor- 
responding random graph to be somewhat below 1. So, random graphs could still comprise 
small clusters and be complete. 

Another paradoxon arises if two-mismatch graphs are considered. For those the connec- 

(2) (2) 

tivity and percolation thresholds p c and pconn fall together. How could then small clusters 
and a seperate great component coexist if all random graphs are connected? Yet, for the 
case of finite systems it is clear that p^(d) < Pconn(rf), i-e. a great cluster has to be formed 
first, before it can devour his competing small rivals. 

Consequently, also for the case of two- mismatch-graphs the biologically interesting regime 

fty\ 

is well defined. For finite systems there is a range of occupation probabilities p c (a) < p < 

(2) (2) 

Pconn(rf), where all requirements are met. Due to the fact that both probabilities p c (a) and 

(2) 

Pconn(rf) are converging to zero in the limit d — > oo it follows that the extent of this range 

(2) (2\ 

\pc (d) — Pconn{d)\ will also become very small for large systems. 

In the next Section our interest will be shifted towards the intrinsic structure of great 
clusters. Therefrom further conclusions about biologically relevant parameter regimes can 
be drawn. 



Resuming results for percolation on mismatch-graphs from Sec. I-IV two phase transi- 
tions have been found to occur. The ranges below p c (since there is no great cluster) and 
above p C onn (since there is only one great cluster) are of no interest concerning the biologi- 
cal background of the model. The one great cluster-many small clusters situation between 
both, however, is likely to fulfill some requirements for idiotypic networks (see Sec. II). In 
this Section further investigations into the network structure within this parameter regime 
will be made. 

Insights into the structure of a great cluster can be obtained from the mass distribution 
of this cluster M c (s) = {v £ C\d(v,c) = s}, i.e. the information, how many vertices have 
a certain distance s from a starting point c. Due to the equality of all starting points c we 
define 



and consider the mean value of M(s) calculated over all clusters, whose size exceeds a 
minimum value 0.5 x p x 2 d . For the metrics d(- , •) there are two distinguished choices, viz. 
metrics defined by (§) allowing paths in G = G\, i.e. d(- , ■) = d G , and such restricting paths 
to C itself, i.e. d(- , •) = dc- Since d c > da we confine our investigations to d{- , •) = dc 
which provides a better 'resolution'. In the following we will discuss some typical cluster 
compositions to obtain a survey of possible conclusions from mass distributions to cluster 
structures. 



VI. INTRINSIC STRUCTURE OF GREAT CLUSTERS 




(28) 
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Basically, three different situations could be imagined (for (i) and (iii) see Fig. §). 

(i) A cluster could consist of some loosely clinged high connectivity regions (heaps), whose 
vertices are distinguished by a great number of connections with each other. Other vertices 
are bound with relatively few links. Clearly, strong connectivity within heaps means, that 
vertices belonging to them have nearly the same distance from all other vertices. Thus, the 
presence of many heaps should result in a great number of local extrema, caused by 'looking 
from all heaps' to each other. On the other hand, loosely bound vertices will smooth the 
mass distribution, i.e. reduce the sharpness of extrema. 

(ii) A cluster could contain no distinguished parts at all. From (]£]) we know that till 
s = -D max the number of vertices is increasing with the distance s. For small occupation 
probabilities p < p CO mi (for which not the whole base-graph has been 'conquered' yet) we 
expect a compromise between with s declining probabilities that a vertex of that cluster has 
distance s from an origin v and the increasing number of vertices with s. Consequently, the 
mass distribution should exhibit one maximum. For large p > p corm mass distributions can 
simply be derived by multiplying (§) with p. 

(iii) As a special case of (i) a cluster could be made of one heap and a certain fraction of 
loosely bound vertices. Then up to two maxima, caused by the heaped vertex concentration 
and the competing tendencies (see (ii)), respectively, are likely to occur. The sharpness of 
both maxima will depend on the fraction of the loosely bound vertices. Thus, in case of 
a large proportion of those vertices both extrema could be smoothened to just one broad 
maximum. 

Fig. |] shows simulation results for the normalized mass distribution for different val- 
ues of p. Clearly, all distributions are marked by only one maximum, whose sharpness is 
increasing with growing values of p. In the vicinity of the density threshold clusters are 
already stretched over the whole extent of the base graph. Hence, for p = 0.4 slightly below 
Pcorm = 0.5 the curve of the mass distribution looks similar to the exactly known distribution 
for the fully occupied base-graph given by Eq. (|B|), cf. also Fig. [| 

More interestingly, for small p's in the vicinity of the percolation threshold relatively 
broad maxima occur, which could be an indication for cluster structures as described in 
(iii). 

To prove the validity of this hypothesis we have applied the edge-elimination algorithm 
(see Sec. II) to great clusters. As a slight extension of the described procedure all fractioning 
edges are removed, thus splitting a cluster C into the sequence of its doubly connected 
components, or heaps in the above sense, (Ci, C t ). Obviously, vertices belonging to 
doubly connected components are distinguished by their strong cliquishness in comparison 
to other vertices. Thus, the number of such components t should allow to distinguish between 
the situations (i)-(iii). 



Fig. [T(] shows simulation data for the mean value (t) of the resulting non-trivial parts 



after edge-elimination depending on p. The distribution is marked by one maximum, which 
is again a consequence of two competing tendencies. For small values of p clusters are 
generally not doubly connected, but increasing p leads to a larger proportion of vertices that 
belong to loops. On the other hand, there is a tendency that loops get connected by loops, 
i.e., seperate doubly connected cluster components are growing together. Thus, for large p 
almost every cluster will consist of one doubly connected component only. 

Results of Fig. [TO show also that the maximum is reached slighly above the percolation 
threshold p c . Then, the number of doubly connected parts t is rapidly declining till it 
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asymptotically approaches t = 1 for p — > 1. We argue that this behaviour is caused by 
one great doubly connected component which occurs first for some p%p c and then gradually 
incorporates all other doubly connected parts. In principle, these results do also apply to 
two- mismatch graphs (see Fig. |ll|). 

Comparing this with the above described scenario of cluster structures, we can thus 
state that there is a range of values of p^p c where (iii) applies, i.e. clusters are made of one 
great doubly connected component (including several small ones) and a set of other loosley 
bound vertices. Accordingly, within this range of p we define two subsets of great clusters 
C, namely the great doubly connected component B = and the complementary set 
P = C- B. 

As already discussed in Sec. I, the central part of idiotypic networks (corresponding to 
great clusters C) should contain strongly and weakly connected distinguished subsets. From 
the previous analysis it becomes clear, that there is a choice of the only parameter p, where 
bit-string models can exactly reproduce such a situation. 

VII. CONCLUSIONS 

We investigated statistical properties of a bit-string based model for idiotypic networks 
and compared typical architectures of the thus defined network with axioms and hypotheses 
for idiotypic networks from theoretical biology. Before introducing randomness we undertook 
an analysis of the underlying base graphs to show a major difference to standard percolation 
problems which bases on the way how the size of the system is increased. 

In the following the expressed antibody repertoire was identified with graphs, created 
by randomly occupying a matching rule defined base structure (base graphs). Concepts of 
percolation theory have been applied in order to determine the percolation threshold. 

The immune system is a very large, yet finite system. Consequently, finite size corrections 
have to be taken in account. Series expansion techniques allowed the calculation of two 
critical exponents (for finite systems and in the limit d — > oo) that characterize the scaling 
behaviour. A regime of values for the parameter p > p c has been found where random 
graphs consist of many small clusters and one great connected component. For choices p^Lp c 
our model reproduces the peripheral/central part conception for idiotypic networks. 

The translation of the notation of the completeness of idiotypic networks into the lan- 
guage of graph theory allowed the determination of an upper threshold Pdense for the oc- 
curance of the many small clusters-one great cluster situation. Furthermore, relying on 
general results of the theory of random graphs (for so-called sequences of configuration 
spaces) we calculated the - in this case coinciding - thresholds for the density (pdense) and 
the connectivity {p corm ) properties. 

Furthermore, we developed techniques to obtain additional information about the struc- 
ture of great clusters. Analysing random graphs for p%p c great clusters can be decomposed 
into two subsets of vertices with different binding properties, namely groups of doubly con- 
nected vertices (backbone) and loosely linked vertices (peripheral part of the great cluster). 

Thus, bit-string models are suited to describe a hierarchy of connectivity levels, that 
really existing idiotypic networks are also expected to exhibit. Our results support to some 
extent idealized architectures used in a mean field type model to describe the dynamics of 
the central part of the immune system |TB[. Contrary to other model approaches [P|,P£|-|3"T|| 
for topologies of idiotypic networks our simple few parametric bit-string model produces 
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a non-trivial seemingly realistic network topology, without assuming a priori distinguished 
vertex groups. 



APPENDIX A: RENORMALIZATION GROUP APPROACH 

In this appendix we want to present a method to approximate the connectivity threshold 
for one-mismatch-graphs. The extension of ideas of the renormalization group theory could 
also be applicable to more complicated cases which do not allow an exact treatment. 

Our approach bases on the idea of the renormalization of small cells |33| which employs 
the self similarity of the system on different length scales at the percolation threshold. A 
condition to apply this procedure is that an appropriate grouping of sites on the original 
lattice leads to a renormalized lattice with the same symmetry properties as the original 
one. Here, treating not a real space lattice but functional networks, we adapt the idea of 
the renormalization group theory in the following way. We find a transformation 1Z which, 
by grouping of vertices, leads from the one-mismatch base graph G d+2 in dimension d + 2 to 
a graph 1Z(G l d+2 ) which is equivalent to the base graph G\ in dimension d, 1Z{G l d+2 ) — G d , 
thus allowing a systematic reduction of the degrees of freedom. 

If a - for finite systems appropriately defined - threshold property, namely p CO nn(d), 
converges to a certain value for d — > oo, differences between p^ and Pd-2 must be small and 
will disappear in the limit of infinite systems. Consequently, if a grouping of vertices to 
super vertices on a graph G yields a renormalized graph 7Z(G) of the same type, percolation 
thresholds on both graphs will be the same. Thus, we replace the term symmetry (as it is 
applied to lattices) by equivalency of graphs. 

We encode an arbitrary vertex of the base graph G d+2 by (A,b ,bi) where A is a bit 
chain of length d and 60, &i ar e single bits. Every vertex on G d+2 belongs to a unique 
4-loop {(A, b , 61), (A, bo, bi), (A, b , b{), (A, b , bi)} connected by one-mismatch links. The 
renormalization 71 replaces this 4-loop by the 'super'vertex (A) on Tt{G d+2 ) (of course the 
choice of (A) leads to identical results). If two vertices A and B are connected by an inversion 
(one-mismatch) link on G d the vertices of the corresponding 4-loops on G l d+2 are connected 
by 4 inversion (one-mismatch) links, too, cf. Fig. |12[ 

We thus define a renormalized graph TZ(G d+2 ) composed of the above explained vertices 
and edges. It follows directly from our construction that Tt{G l d+2 ) is equivalent to G d . 

To describe random graphs we apply the following majority rule: a super vertex on G l d 
is considered as occupied if at least two connected vertices of the corresponding 4-loop on 
G l d+2 are occupied, i.e. this 4-loop is said to percolate. On the basis of this rule we obtain 

p' = 4p 2 (l-p) 2 + 4p 3 (l-p)+p 4 , (Al) 

where p' denotes the (renormalized) probability that vertices of (G) are occupied if vertices 
on G were diced with probability p. 

Calculating fixed points of eq. ( |A1[) we obtain p* = 0, (3l\/5)/2, and 1. Thus, as the 
only unstable solution in [0, 1] we have p* = (3 — y/E)/2 as a first approximation for the 
connectivity threshold. 

At this place one could pose the question whether only 4-loops are suited as a renormal- 
ization cell. Indeed, it is also possible to summarize successive renormalization steps to a 
single great one, ordinary hypercubes of even dimension k < d form suitable cells as well. 
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On the other hand, condensing odd dimensional hypercubes to super vertices does not lead 
to renormalized graphs 1Z{G) that are equivalent to other base graphs of the sequence {G\}. 
Altogether, this seems to be an effect of differences between even and odd- dimensional base 
graphs (see Sec. II). 

Of course, renormalizing small cells entails some approximation. Grouping together 
vertices and applying a majority rule to occupy the renormalized vertices, situations can 
arise where clusters on the original lattice are cut or new clusters are formed 

There are different approaches to improve the involved approximations. One possibility 
suggested in which leads to exact results in the limit of very large cells is to summarize 



some elementary cells to one large cell of size z. This large cell will then be occupied if all 
elementary cells are occupied and connected, i.e. are said to percolate. Since 'renormaliza- 
tion faults' are essentially due to cell surface effects improvements produced by the above 
method emanate from the declining surface-volume ratio of large cells. For one-mismatch 
graphs it holds, however, that the surface size of a cell s(z) depends logarithmically on z, 
viz. s(z) ~ d + 1 — log 2 z leading to only slight improvements with increasing z. 
Using larger elementary cells consisting of two coupled 4-loops we obtain 

[p'f =p 8 + 8p 7 q + 24pV + 32pV + 12p 4 q 4 , (A2) 

using four coupled 4-loops yields 

(j/) 4 = p 16 + lQp 15 q + \\2p l4 q 2 + U8p 13 q 3 + 1120p 12 q 4 + 

+ 1792p n g 5 + 1776p 10 g 6 + 1008pV + 180pV (A3) 



where q = 1 — p. As fixed points p' — p of (|A2[) and (|A3|) we determined numerically 



p 7H 0.41 and p ~ 0.39, respectively. The difference of both values to p CO nn = 0.5 may be a 
consequence of the above mentioned slow convergence. 

More interestingly, our renormalization procedure of replacing hypercubes of dimension 
k by super vertices is without modifications applicable to ordinary hypercubes as well. This 
gives an additional argument that the connectivity thresholds of both sequences of graphs 
are equal which can also be verified by evaluating eq. ([25]). 
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FIGURES 





FIG. 1. Comparison of a hypercube (left side) with the one-mismatch base graph (right side) 
which have the same set of vertices (d = 3). Solid lines connect perfectly complementary vertices, 
dashed lines mean one-mismatch links. 
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FIG. 2. One-mismatch base graph in d = 4. Same notation as in Fig. [l]. 
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FIG. 3. Normalized mass in the dimensionless distance s from an arbitrary origin in even 
(d = 34) and odd (d = 35) dimension. Points have been connected to guide the eyes. 
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FIG. 4. Cluster size distribution depending on the occupation probability p = 0.1,0.2,0.3 for 
base graphs in 8 dimensions. The histogram was taken over 10 5 configurations. 
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FIG. 5. Cluster size distribution on G\ and G{ 4 at the percolation thresholds. The simulation 
data are plotted on doubly logarithmic scales. 
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FIG. 6. Illustration of finite size scaling laws for the two-mismatch-graphs G\ G and G\ 2 - Data 
sets are plotted on doubly logarithmic scales. 
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FIG. 7. Simulation results for the fragmentation rate / on G\ depending on the cluster size \C\ 
at the numerically determined p c ~ 0.112. Data are plotted on doubly logarithmic scales. Fitting 
a power law (dashed line) yields A ~ 0.12 to be compared with A ~ 0.11 from Eq. (p4|). 
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FIG. 8. Visualization of different cluster structures. The circles symbolize strongly connected 
subclusters (heaps). Left hand side: the cluster consists of several weakly connected heaps. Right 
hand side: The cluster contains one heap only. 
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FIG. 9. Simulation results for the normalized mass distribution on G\ for varying probabilities 
p = 0.1, 0.15, 0.2, 0.3, 0.4 and the results of Eq. (|6|) for p = 1.0 (In order to make results for different 
p's comparable on the same scale masses have been divided by the cluster size |C|). Points have 
been connected to guide the eyes. 
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FIG. 10. Mean value (t) of the resulting great fragments after application of the 
edge-elimination procedure for G\ depending on the occupation probability p. 
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FIG. 11. Mean value (t) of the resulting great fragments for the two-mismatch-graphs Gg, G§ 
and G\q depending on p. Except a shift towards smaller values of p the results are similar to those 
on one-mismatch-graphs 
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FIG. 12. Illustration of the renormalization procedure applied to a pair of connected 4- loops on 
G\ +2 leading to a pair of linked super vertices on G\. If du(B, A) = the dotted edges correspond 
to inversion links, if dn{B,A) = 1 to one-mismatch links. 
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